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1 Abstract 


In this thesis, which is supervised by Dr. David Penman, we examine 
random interval graphs. Recall that such a graph is dehned by letting 
Xi,... Xn, Yi,.. .Yn he 2n independent random variables, with uniform dis¬ 
tribution on [0,1]. We then say that the fth of the n vertices is the interval 
[Xi, Yi\ if Xi < Yi and the interval [Yi, Xi] if Yi < Xi. We then say that two 
vertices are adjacent if and only if the corresponding intervals intersect. 

We recall from our MA902 essay that fact that in such a graph, each 
edge arises with probability 2/3, and use this fact to obtain estimates of the 
number of edges. Next, we turn to how these edges are spread out, seeing 
that (for example) the range of degrees for the vertices is much larger than 
classically, by use of an interesting geometrical lemma. We further investigate 
the maximum degree, showing it is always very close to the maximum possible 
value (n—1), and the striking result that it is equal to (n —1) with probability 
exactly 2/3. We also recall a result on the minimum degree, and contrast 
all these results with the much narrower range of values obtained in the 
alternative ‘comparable’ model G{n, 2/3) (dehned later). 

We then study clique numbers, chromatic numbers and independence 
numbers in the Random Interval Graphs, presenting (for example) a result 
on independence numbers which is proved by considering the largest chain 
in the associated interval order. 

Last, we make some brief remarks about other ways to dehne random 
interval graphs, and extensions of random interval graphs, including random 
dot product graphs and other ways to dehne random interval graphs. We 
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also discuss some areas these ideas should be usable in. We close with a 
summary and some comments. 
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3 Basic results 


3.1 Definitions 

We recall here the dehnitions of interval graph, and random interval graph, 
from our essay j2]. 

Definition 3.1 Let a graph G have n vertices {l,2,...n}. To create an 
Interval Graph, to each vertex i we assign a finite interval fiof the real line. 
This transformation yields n intervals of the real line. We then say that two 
different vertices of G are adjacent, in the initial graph, if the corresponding 
intervals have non-empty intersection. That is, i ^ j hn Ij ^ 0. 

Definition 3.2 A Random Interval Graph is formed as follows. Suppose the 
vertex set is {1,2. . .n}. Let 2n independent and identically distributed con¬ 
tinuous random variables, Xi, X 2 ... Xn and Yi, Y 2 ... Yn, from the uniform 
distribution in [0,1], be given. The interval R will be [Xi,Yj\, if X^ < Yi or 
\Yi,Xf\, when X.i > Y,. (The case where any two random variables are egual 
has probability 0, by properties of continuous distributions). We then say that 
the random interval graph is the interval graph formed for these vertices from 
the intervals R. 

We use An to denote the set of all possible random interval graphs on n 
vertices. 

The main aim of this essay will be to prove various basic properties of these 
graphs, based on the two papers j9] by E. R. Scheinerman and |3]. We will 
Erst show some equivalent formulations of the model, which will be useful in 
proving theorems, and will then prove some results about various aspects of 
the graphs. 
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3.2 An Equivalent model 


In [9] it is observed that we do not have to use the uniform random variables 
Xi and Yi above. He observed that it is enough to suppose that the intervals 
have as their endpoints the numbers 1,2... 2n in some random order, with all 
the (2n)! possible orderings equally likely. The reason why this is equivalent 
is that, as we observed when defining random interval graphs, the probabil¬ 
ity that two of the Xi and Yj are equal is zero, so they can be any set of 
unequal numbers. Also, because the Xi and YJ are all independent, all {2n)\ 
possible orderings of them have the same probability. (Independence implies 
exchangeability - that is, the property that the probability that the XjS and 
YiS take certain values is the same as the probability that the image of them 
all under some permutation in S 2 n, the symmetric group on 2n letters, take 
these values). This concludes a proof of the following result, which will be 
used later in the proof that there is vertex of degree (n — 1) with probability 
2/3. 

Lemma 3.1 An equivalent definition of random interval graphs is to say 
they have vertex set {1, 2 ... n} and that the intervals attached to the vertices 
have as their 2n endpoints the numbers 1, 2... 2n in some order, the order 
being chosen uniformly at random so that all {2n)\ possible orders are equally 
likely. 

Proof. See above. 
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3.3 Edges in interval graphs 

The simplest question about a graph is how many edges it has. By Theorem 
10 in [2], we know that the probability of any particular edge arising in a 
random interval graph is 2/3. We hrst extend this result to obtain an estimate 
of the total number of edges in a random interval graph, namely that it is 
near to n^{l + o(l))/3. This is the first main result in E. R. Scheinerman’s 
paper j9]. 

It will be helpful to give an overview of the proof. It is easy to show 
that the expected number of edges is exactly n{n — l)/3 using the fact about 
the probability that an individual edge arises is 2/3. What we need to do 
is to show that it is very likely to be very close to this value - the precise 
sense of this will be made clear in the statement of the result. (The result 
is a limit result: many interesting results in probability, e.g. the law of large 
numbers and the Central Limit Theorem, are of this form). In order to show 
that it is very likely to be very close to n{n — l)/3, we shall look at the 
variance of the number of edges and show that it is small compared with 
n{n — l)/3. The way this will be done is by writing the number of edges as 
the sum of indicator variables, one for each edge. Evaluating the variance 
then involves considering the various possible values of the expectation of 
XijXk£ for various possible i, j, k, 1. We shall see that in most cases they are 
independent, so the total variance is really rather small. 

Theorem 3.2 Almost all graphs in have n^/3 + o{n‘^) edges. More for- 


mally, letting X denote the number of edges in the graph, 

Proof. For every pair (i, j) with 1 < i, j < n and i 7 ^ j, let the indicator 
random variable Xij be equal to 1 if i ~ j and be equal to 0 otherwise. (In 
other words, Xij takes the value of 1 , if the intervals corresponding to the 
vertices i and j intersect, so that the edge is present). 

Now let X = Then each edge which is present contributes 1 

to this sum, and those absent contribute 0, thus X counts the total number 
of edges in the graph. 

As we previously saw, P{Xij = 1 ) = 2/3. Because Xij is an indicator 
variable (i.e. takes only the values 1 and 0 ) we have 

00 

^ rP{Xi, = r) = 0 + 1 X = 1) = P{Xij = 1) 

r=0 

Thus 

E(X) = E( y X„) 

l< 2 <j<n 

= y e(a'«) 

l<i<j<n 

= E P(x„ = i) 

l<i<j<n 

n(n — 1 ) 2 

“ 2 3 

n{n — 1) 

“ 3 

To estimate the number more precisely, we need also to have some grip on 
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the variance of X, namely 


Var(X) = E(X=^) - (E(X))2 

= E( E x„ E A'.,) - (h!niil)2 

l<i<j<n l<k<£<n 

(the last term is because we just worked out E(X))! Now we consider various 
cases for i,j, k and i, counting - or at least estimating - carefully how many 
such cases there are. 


Case 1. i,j, k, i are all distinct. As i < j and k < i, there are (^ 2 ) (^” 2 ^^) /2 
possibilities (as the first two are chosen from all n, the second two from the 
remaining (n — 2). Thus we get n{n — l)(n — 2)(n — 3)/4 cases. In each of 
them, 'E{XijXki) = E(Xjj)E(Xfci) = 2/3 x 2/3 = 4/9. This is because the 
two edges are independent, as they have no vertex in common. 


Case 2. There is some overlap between set {i,j} and {k,i}. There are 
however only a small number of these. Indeed since at least two of i,j,k 
and i have to be equal, we are only choosing (at most) three numbers. Thus 
there are at most (^ 3 ^ such cases. In each of these cases, the contribution to 
the sum made by Ei{XijXki) is at most 1 , simply because Xij and Xki are 
< 1 - 

Then, calculating, we get 


Var(X) = E( ^ X,, ^ X^i 

l<i<j<n l<k<i<n 

n{n — l)(n — 2)(n — 3) 4 


,n{n — 1 ). 


< 


- from Case 1 
9 
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+ 


' n' 


X 1 from Case 2 


n(n — l)(n — 2)(?7, — 3) fn^ 

9 ^ la. 


3 ^ 

n^{n — 1 )^ 
9 


nin — l),, , 9(n — 2) 

-[{n - 2)(n - 3) H--- n{n - 1)] 


9 
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The above is, since the two terms in the square bracket involving cancel 
with each other, of the form 


n{n — 1) 
9 


[Cn + D] for suitable constants C, D 


< Erf 


for a suitable constant E. 

Now we use Chebyshev’s inequality, which says that, for any random 
variable X, 

F(|X-E(X)|>e<^^^^^^. 

See |2]. Here we thus deduce that, for any e = cn?, we have 

P(\X-nX)\>cn-^<'?^ 

[cn^Y 

Em? 

~ 

which, for large n, tends to 0. Thus with probability tending to 1, we do 
indeed get (1 + o(l))n^/3 edges. • 

It is perhaps worth noting that it is natural to compare a random interval 
graph with another well-known model of random graphs. This is the so-called 
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Erdos-Renyi model G{n,p) where there are n vertices and each edge arises 
with probability p independently of all other edges. This model is discussed 
in great detail in pp. Clearly the most reasonable such model to compare 
random interval graphs with is G(n, 2/3) as we know that in random interval 
graphs 2/3 is the probability of each edge arising. The above result does have 
a simple analogue for Erdds-Renyi random graphs, namely that an Erdds- 
Renyi random graph has about n^/3 edges. The proof in this case is much 
simpler: Indeed the law of large numbers, m says that the number of edges, 
divided by the total number of possible edges n{n — l)/2, is close to the 
expectation of any one of the indicators, namely 2/3. Thus the number of 
edges is close to n{n — l)/3. The reason why this case is so much easier is 
that the edges in the Erdds-Renyi graph are independent, so that standard 
results like the law of large numbers, m apply to them. (It will emerge later 
that the edges are not independent, when we show that various things in a 
random interval graph usually take very different values from their values in 
G'(n,2/3).) 
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4 Degrees in Random Interval Graphs 


4.1 Introduction 


We now turn our attention to the more detailed distribution of where the 
edges are. We shall consider the degrees of vertices and show that degrees 
are much more spread out that in the Erdos-Renyi model G{n,2/3). Indeed 
in the Erdos-Renyi model G{n, 2/3), we shall see that, for all e > 0, 


lim P 

n^oo 


all vertices have degree between 


2n{l 


and 


2n(l + e)' 


= 1 . 


That is, almost all degrees are about 2n/3. However in random interval 
graphs, we shall see in various ways that degrees are much more ‘spread 
out’. For example, the probability that there is a vertex of degree n — 1 (the 
maximum possible degree) is 2/3. 


4.2 Degrees of Graphs and some results 

We recall hrst a dehnition from |2]. 

Definition 4.1 The degree of a node of a graph is the number of vertices 
which are adjacent to this vertex. If v is a vertex, then the degree of v is 
denoted by deg(v). 

For a graph G, we define A(G) to be the maximum degree: that is, 

= max 

l<i<n 


d{vi). 
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Similarly we define the minimum degree 


5{G) = min d{yi). 

l<i<n 

We saw in the last section that a random interval graph has about u^/3 
edges. Our next task is to show how these edges are spread out. We will see 
a theorem from j9] about the degrees of random interval graphs. 

Theorem 4.1 Let G G and v G V{G). For a known x G [0,1], we have 
for X >1/2 

71 

lim P{d{v) < xn) = 1 — (1 — a:) — 
and for x < 1/2 we have 

Jin^ P{d{v) < xn) = 1 — (1 — x)( 7 r /2 — 2 cos“^[l/\/2 — 2x]) — \/l — 2x 

Note what the theorem means. It says that, for example, taking x to be 
(say) 0.01, the probability that there is a vertex of degree < O.Oln is, in the 
limit as u —)■ oo, strictly positive. This is very different from what happens 
in G(? 7 ,, 2 / 3 ) where, as mentioned earlier, all the degrees are close to 2n/3. 
Similarly it says that the probability that a vertex does not have degree 
< O.OOn - that is, that its degree is at least 0.99n - is, in the limit, a non-zero 
number. So the degrees are indeed much more spread out than in G(n, 2/3). 

The proof will rely on the following geometrical lemma. We give a rather 
detailed proof of the Lemma as no details are provided in Scheinerman’s 
article [9]. Some of the details here were suggested to me by Dr. Penman 
0 , 

Lemma 4.2 For an interval I = [x, z] C [0,1] let the radius of I, p{I) be 
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a? + {1 — hY, where a = min{a;, z} and b = max{a;, z}. Assume that x and 
z are independent, and uniformly distributed on [0,1]. Then, fory <1/2 

P{pV)<y) = Y- 

When y > 1/2, we have that 

<y)= y{^ - 2cos-^[^]) + sj2y-l. 

Proof. The required probability is the probability that p{I) < which 
is the size of the set of points in the square [0,1]^ which are within a distance 
y/y from (at least) one of the points (0,1) or (1,0). Let us consider all the 
possible cases for 0 < y < 1. The equation of the circle centered at (1, 0) is 
{x — 1^) + z"^ = y. In the second circle, centered at point (0,1) we have the 
equation x"^ + {z — 1^) = y. So if they intersect at {x, z), then 

{x — 1)^ + z"^ = x"^ + {z — 1)“^ = y 
x^ - 2x + 1 + z^ = x^ + z'^ - 2z + 1 
^ —2z = —2x ^ z = X 


Thus, these points satisfy; 


{x — lY + x^ = y 
^ 2x^ — 2x + 1 = y 

9 1 — y 

x^ — X ^ -^— = 0 
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For such points to exist, we need that the discriminant is greater than 0. 
Thus, 


(-1)^ - 4 ^^—— > 0 

V ; 2“ 

^ 1 - 2 ( 1 -!/) >0 

^ ( 2 |/ - 1 ) > 0 

1 

y > - 

y _ 2 


Now let consider the two cases for the value of the radius. 


Case 1. Let |/ < |, then y/y < l/\/2 is the radius of the two circles centered 
at either the point (0,1) or (1,0). Because y < these two circles do not 
intersect by the above analysis. Then the area we want is two quarters of 
disjoint circles which have radius equal to y/y. Having in mind that the area 
of a circle with radius y is equal to tt x then the area of a quarter of circle 
is Itt X yjy^ = Itt X y. So the required area is equal to as stated. 


Case 2. y > 1/2. In this case, the circles have intersection and by the 
above we have for x = z, that: 


2 , (l-I/) 

X — X -] --— 


, 1,0 ( 1 - 1 /) 1 
^22 4 


0 

0 
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( 1 ) 

( 2 ) 




( 3 ) 


( 4 ) 


( 5 ) 


In this case, as previously noted, the two circles intersect in two points, A 
and B say. We consider the triangle with vertices (0,1), A and B and the 
triangle with vertices (1, 0), A and B, also the arcs of the two circles between 
A and B. Let 0 be the angle formed at [0,1] and [1, 0] by the two triangles 
(by symmetry, the two angles are the same). The area of each sector of each 
circle is equal to 


0 0y 
71 X y — = — 


Also the area of the triangle is 4 x \CA\ x \CB\ x sin(6*), where C Is the point 
(0,1) and A,B are the intersection points, between the two circles. 


Then the total area of intersection of the two sectors is: 


2[^-\\CA\\CB\s\ne\ 

= 0y-\CA\\CB\AYi0 

To obtain the angle 6*, recall that the vertices are: C = (0,1), B = 



Now = |ACp +|C'i?p — 2|AC||Ci?|cos6'by the cosine rule, |lj. The 


vectors are 



17 












ct= + 

^ 2 4 2’V2 


jd = 


ly 


1 , 1 
4 2’ 


ly 


1 _ 1 ' 

4 2' 

T 1. 

4 2' 


^AB^ = [2f--\r+[2^l-\r 

= 8{l-h = 4y-2 


Similarly, 




2 4 


1 1.0 




and then ICAp = y just by symmetry. Then the cosine rule, |1] becomes 


{Ay-2) = y + y - 2y/y^cose 


Ay — 2 = 2y — 2y cos 9 
2y-Ay+ 2 


cos 6* = 


21 / 


6 = cos ^[—1 H—] 

y 


(Note that, as ?/ > 1/2, —1 + 1/?/ < 1 as required). Then, the formula for 
the area of the intersection of the two quarter-circles is (note that here we 
use cos“^ to mean the inverse function to cos, what many people call arccos: 
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in particular, cos ^(x) does not mean 1/ cos(a;)) 


ycos ^[-1 + ^] - v^v^sin[cos ^(-1 + ^)] 


= 2/cos-^-l + - -2/i 1- -1 + - 

2 / V y 


= ycos ^[-1 +-]- \/y^ - {-y + If 

1, 


= ycos ^[-1 + -] - \/2|/ - 1 


To explain the working in the last paragraph; We used cos^ 6^ + sin^ 0 = 1, 
|1] and also the fact that in our situation the angle 6 is clearly between 0 
and 7r/2 so that both cos and sin are positive, with the result that sin(x) = 
— cos^(x). Thus sin[cos“^(—1 + 2)] is equal to — cos2(cos“^(—1 + ^)) 
which is of course equal to — (—1 + 

What we have just worked out is the formula for the area of the inter¬ 
section of the two quarter-circles. Thus the shaded area in Scheinerman’s 
picture is the areas of the two individual quarter circles minus the area of 
their intersection, which of course is 


2 



'^y -ir 

— - ycos [ 


-1 + -1 + 

y 


] + \/ 2!/-1 


It only remains to conhrm that this formula we have just derived is the same 
as the one given in Scheinerman’s article, namely 
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For this in turn it suffices to show that 


To see this, take cosines of both sides, and recalling the identity cos(2a:) = 
2 cos^(a:) — 1 , [ 1 ] we see the left-hand side is: 


2 cos^(cos —1 


= 21 




— —1 H— 

y 


as required. • 


Proof of Scheinerman’s theorem, (see Theorem 4.2 in [9]: we provide 
some more details). Let v corresponds to vertex 1: no generality is lost by 
this, as no vertex is favored by the set-up. Let Jj be the interval assigned to 
vertex i, for each i = 2.. .n. Then let = 1, if 1 is adjacent to i and be 
equal to 0 otherwise. Thus X = Y ^^=2 ^^e degree of the vertex 1. 

Suppose now that p{Ii) = r is fixed. We claim that then: 

P = P{Ii = l|p(/i) = r) = 1 - 

When we have proved this, it is then obvious that, conditional on p(/i) = r, 
the expectation of X is (n — l)p and its variance (again conditioned on the 
value of p(/i)) is (n — l)p(l — p) since the li are independent of each other 
given 1 1 and the radius. Indeed, if i 7 ^ j, then the two random variables Xi 
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and Yi giving the two endpoints of Jj are independent of Xj and Yj giving 
the endpoints of Ij. 

Thus we can use Chebyshev’s inequality again on the random variable X, 
[8] and deduce that; 


P{\X- 


in 


- Ibl > 




Then, d{l) = np + o{n), under the hypothesis that p(/i) = r. Thus for any 
e > 0 we have: 

1 — o(l) for r < y/1 — X — e 
o(l) for r > ^/T — X + e 

using our formula for the value of p for a given value of r. So now we need to 
remove the conditioning on the value of r, which we do in the usual manner; 

-P(<^(1) < xn) = / P{d{l) < xn\p{Ii) = r)dP{p{Ii) < r) by the law of total probability 

But we have just worked out the distribution function for the probability 
that p{Ii) < in the Lemma. So this is; 

[1 - o{l)]P{p\h) <l-x)+ eO(l) + o(l) 

^P{p\li)<l-x). 

Now the result follows using the Lemma, with y replaced by 1 — x. 


P{d{l) < xn) = 
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Thus the only thing that remains to be proved is that, if p{Ii) = r, 
then the probability that any other interval intersects is 1 — To obtain 
this, recall that in [U], the radius of an interval [x,y] C [0,1] is equal to 
a? + {1 — hY = r, where a = minja:, ?/} and b = ma.x{x,y}. Also, we 
should remind ourselves that the (2n) random points follow the uniform 
distribution in (0,1). Let Ji = [a,b]. We calculate the possibility of the 
existence of i interval, denoted by A = [Xi, Yj\, which does not intersect with 
Ji. To happen this, both Xj, Yi must be smaller than a or both must be 
greater than b. The first possibility is: P{Xi,Yi < a) = since 

they are independent, uniformly distributed random variables in (0,1). The 
second possibility is equal to: 


P{X„Y,>b) 
= P{X, > b)P{Yi > b) 
= {l-P{X,<b)){l-P{Y,<b)) 


= {l-b){l-b) 

= {i-br. 

Thus, the probability of /iH/j = 0, for i = 2, 3 ... n is equal to + (1 — 6)^ = 
r^. Hence the complement probability (Ji and have a non-null intersection) 
is 1 — • 
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4.3 Maximum degree 

In the last section, we saw there is a non-zero probability that a vertex has 
very high degree (e.g. at least 0.99n). In this section, we hrst sharpen this. 
We then give a resnlt from [3] which attempted to prove, but did not 
succeed. That result is, that with probability 2/3, there is a vertex in a 
random interval graph whose degree is n — 1. This is of course the largest 
possible degree any vertex in the graph can have. 

The hrst result (which is Theorem 4.4. in Scheinerman) is easy. 

Theorem 4.3 In a random interval graph, let ujn be any function which 
tends to infinity with n. (One should think of it as doing so very slowly). 
Then 


lim P [a random interval graph has A > n — uj(n)) = 1. 

n—>-oo ' ^ \ J/ 


Proof. Some details of that proof were suggested to me by Dr. Penman 
[6], as Scheinerman’s proof for that Theorem is rather short. Let x = 

Then 


UJr 


P{> n —— + o{uJn) intervals intersect [x, 1 


= 1 — P{> n 


UJr 


+ o(ujn) intervals don’t intersect [x, 1 


= 1 — P(interval doesn’t intersect [x, 1 


-x]) 

-x]) 

^]r 



as the probability a random interval does not intersect [x, 1 — x] is the prob¬ 
ability that both ends are less than x (probability x x x = x^, using the 
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independence of the ends) or that both ends are greater than 1 — x (proba¬ 
bility [1 — (1 — xyf' = x^). The above is 



n 

1 8lS n > ojn when n ^ oo 

Thus at least n—oJn/‘^+o{oJn) intervals intersect [x, 1—x]. Our proof will now 
be complete if we can show that, with probability tending to 1 as n —)■ cxo, 
there is at least one of the random intervals which contains [x, 1 — x], as then 
such an interval will be a vertex of degree > n — u{n). 

To this end, recall the fact that random intervals are independent, as 
all the possible orderings of their endpoints are equally likely. Let Xi be 
an indicator variable, denoting whether the interval Jj = [Ai,Bi] contains 
[x, 1 — x]. Then X = Yh=i Xi is the number of our n random intervals which 
contain [x, 1 — x] and our objective is to show that X > 0 with probability 
tending to 1. To this end we use Chebyshev’s inequality, [8| 

P(|X - E(A')| > () < 

^ P{X = 0) < P{\X - E(X)| > E(X)) 

^ Var{X) 

-jEixW 

Now Xi is a Bernoulli variable and the various Xi are independent of each 
other, simply because the distinct intervals are independent of each other. 
Thus X is binomial, with parameters n and the success probability a. We 
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thus have, by standard results about Binomial random variables, 


n 

E(X) = na and Var(X) = ^ Var{Xi) = na{l — a) 

i=l 


and thus by Chebyshev, j8] 


P{X = 0) < 

na{l — a) 
'tPa? 


VariX) 

(E(^ 

(1 -g) 
na 


Thus if we can show that P{X = 0) tends to zero, this will show that 
P{X > 0) tend to 1. 

We need to calculate a. Recall 


a = P{Xi = 1) = P(a given random interval contains [x, 1 — x]) 


Letting the two (random) ends of the interval be A and B, we have that this 
probability is 

a = P{{A < X and B > 1 — x)} U {A > 1 — x and B < x}) 

= P{A < X and B > 1 — x)) + P{A > 1 — x and B <x) 

as the two events involved are mutually exclusive. This in turn gives 

a = P{A < x)P{B > 1 — x) + P{A > 1 — x)P{B < x}) by independence 
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= x{l — x) + a;(l — x) = 2x{l — x) 



Hence, combining the above results, 


F(X = 0) < ^—- < — 
na na 

1 

n^Jun/n (l - Y^a;„/(2n)) 
1 

nV2^/^(l _ ^ujnl{2n)) 
< - . for large enough n 


using the fact that 


for large enough n. Thus 


1 - 


1 /cJT. 1 

2\n-2 


P{X = 0) < —775 ——)■ 0 as n ^ 00 

and thus with probability tending to 1, X > 0, that is there is some interval 
containing [x, 1 — x] as required. • 


The dual result is the following striking fact. We emphasize that, unlike 
most of the results in this chapter, this has nothing to do with a limit; it is 
an exact result, not depending on the number of vertices. The proof comes 
from |3]. Scheinerman made a lot of effort in his paper to prove a result 
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along these lines, but did not quite succeed. 

Theorem 4.4 Let G be a random interval graph. Then 

P(A(G) = n-l) = | 

Proof. Instead of calculating the moderately difficult expression 

1 — 4n(n — 1) / [ xy{l — — 2xy)"'~‘^ — Idxdy 

Jo Jo 


writers [3] used an efficient combinatorial proof, as we shall see. They take 
random pairs of integers 1 , 2 ,..., 2n. Once the intervals are selected by some 
random pairing of the 2n numbers, they label the endpoints ^(1), 5(1),..., A{n— 
2), B{n — 2) in the following way. Let the endpoints {1,..., n} be at the left 
side and respectively the endpoints {n + 1,..., 2n}, be at the right side. Let 
also A(l) = n and 5(1) is its mate. Suppose that we have assigned through 
A{j), B{j). We label the next endpoints, by the following rules: 

Case 1. If B{j) is on the left side, then let A{j + 1) be the leftmost point 
on the right side that has not yet been labeled. Let B{j + 1) be its mate. 

Case 2. If 5(j) is on the right side, then let A{j + 1) be the rightmost 
point on the left side that has not yet been labeled. Let B{j +1) be its mate. 

Endpoints are being labeled from the center outwards. Then, if A{j) < B{j), 
it is A{j + 1) < 5(j) 
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If A{i) > B{j), it is A{j + 1) > B{j) 
Then, we will either have: 


MJ) < + 1 ) 

or A{j) > A{j + 1) 

Furthermore, if A{j) < B{j), then A{j + 1) < B{j + 1) 

If A{j) > B{j), then A{j + 1) > B{j + 1) 

With this way, starting from the center labeling the endpoints, we deduce 
that either an equal number of points have been assigned in both sides, or 
two more points have been assigned on the left than on the right side. Since 
the last endpoints assigned are A{n — 2) and B{n — 2), from the total number 
of points, which is equal to 2n, there are four remaining points unlabeled, 
namely a < b < c < d. Having a specihc ordering, it is considered all the 
possible ways of pairing them to consist two random intervals, a can be 
matched with b, c or d, with equal probabilities. Thus, we have 3 possible 
cases. We easily observe that in two cases of pairing the remaining points, 
the corresponding intervals intersect and in one case they are disjoint. Let 
now a and b be on left and c and d on the right. If a is paired with c, then the 
random interval [a, c] meets all the others. Also the same happens when a is 
paired with d. This is because, we assumed the points {1, 2... n} lie on left 
and the remaining points {n + 1 ... 2n} lie on the right and by construction of 
labeling them. On the other hand, if a is paired with b, then [a, b] fl [c, d] = 0. 
Suppose that an interval [e, /] intersects all the others. Also let A{j) = e and 
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B{j) = f. In this case, where a and b are on the left, A{j) lies between b and 
c. Thus, [e,/] cannot intersect both [a,b] and [c,d]. Furthermore, consider 
the case where only a is on left. Since [e, /] meets [c, d] we have f > c, hence 
/ = B{j). Again, by construction if a is paired with b, then [a, b] fl [c, d\ = 0. 
On the other hand, in cases where a is paired with c or d the corresponding 
intervals intersect. The probability of pairing a with c or d, in the specihc 
ordering of the four endpoints, which is a<6 <c<d is 2/3. Finally, we see 
that the probability in a family of n random intervals, the maximum degree 
has value n — 1 (i.e the probability that an interval meets all the others) is 
2/3. • 

4.4 Minimum degree 

The previous subsection makes it clear that the maximum degree in a ran¬ 
dom interval graph is much bigger than 2?7,/3, which is roughly the expected 
number of neighbors of each vertex. As we will see at the end of this chapter, 
this is different from ^(n, 2/3) where all degrees are about 2n/3. We now 
say a little about minimum degrees, presenting a Theorem from [9]. We omit 
the proof of this result, as it is moderately difficult. 

Theorem 4.5 Let k he a fixed, non-negative real number and 6 denotes the 
minimum degree of the graph. We have, 

Ji^F((5 < ky/fi) = 1 - exp{-y} 

Note that Theorem 4.1 implies the result below, which is noted in [9]. 
Corollary 4.6 For every e > 0 sufficiently small, almost all interval graphs 
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satisfy 6 < e ■ n and A > (1 — e) • n. 


This result is an immediate corollary of Theorem 4.1. 


4.5 Degrees of vertices in G(n, 2/3) 


In the previous subsections, we studied the minimum and maximum degrees 
of Random Interval Graphs. We now give, for contrast, the result for the 
Erdds-Renyi model, where the probability of an edge arising is constant, 
equal to 2/3 and the edges are independent. Here it will turn out that all 
the degrees are ‘about’ 2n/3. 

Theorem 4.7 In G{n, |) and for e > 0 sufficiently ‘small’, all vertices have 
degree between about (| — e)n and (| + e)n. More precisely, 


lim P{all vertex degrees ^ [(g 


(g + e)«]) 


1 


Some details of this proof were suggested by |B]. Also, for the proof of this 
Theorem, we use the ‘Large Deviations’ Lemma from Scheinerman, [9]. 

Lemma 4.8 If p is constant and e > 0, then 

^ ^-bcpn 

P(\X — np\ > enp) < — - 

y/np 

Where a^, b^ are positive constants, which depend only on e: that is, not on 
n or p. 
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Proof of Theorem 4.7. Let the random variable X denotes the number of 


vertices having degrees not in the interval 

[(^-e)n, (^ + e)n]. 

X is the sum of n independent random variables Xj, z = 1, 2... n where Xi 
is 1 if vertex i has degree not in the stated range and is zero otherwise. We 
again aim to show that X = Yl'i=i X* is 0 with probability tending to one as 
n ^ oo, and to prove this we will use the fact that 

OD OO 

P(X > 0) = ^ P(X = i)<Y, iP{X = i)< E(X). 

i=l i=l 

So we aim to show E(X) —)■ 0, for which in turn it suffices, as 

n n 

E(A') = E(y .Y.) = y E(A'.) 

i=l i=l 

= nE(Xi) as the Xj are identically distributed 

= nP(Xi = 1) 


to show that P(Xi = 1) is o(n). 

But the degree of vertex i in G(n, |) has binomial distribution with pa¬ 
rameters n — 1 and |. Thus, using the Large Deviations Result, [9] 


2 2 

P[degree of vertex i ^ [(- e)n, (—h e)n]] 

3 3 


P(X,) = P(W = 1) X 1 + P(W = 0) X 0 = P(W = 1) 
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E(X,) = P{X, = 1) < 


EiX)=Y.EiX,)< 


a,e ^'3"' 

2n 
3 

-b^n 


n- 


i=l 


And this is indeed o{n) for large n, just because the exponential terms con¬ 
verge more rapidly than polynomial. • 

Remark. In fact the above argument can be sharpened quite substantially: 
a random graph G(n, 2/3) has the property that there is an explicit constant 
C such that 

lim P — cJn log(n) < S{G) < A(G) <‘^ + cJn\og{n) \ = 1. 

The proof is in |T] (but is much harder than the above proof). 
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5 Cliques, independent sets and chromatic num¬ 


bers in Random Interval Graphs 

5.1 Cliques 

In this section we study the clique number uj{G) of a random interval graph 
G. Recalling from [2], the clique number of a graph G, denoted as uj{G) is 
the order of the largest complete subgraph of G. The basic result is from [9], 
Theorem 4.7. 

Theorem 5.1 The clique number of a random interval graph is usually about 
n/2. More precisely, 

mn p(c(G) = I+ o(n)) =1. 

Proof. First note that a maximum clique consists of a family of intervals, 
each pair of which intersect. In our essay, P] we showed that such a family 
of intervals has some point x say, which is in all the intervals. This result is 
known as Helly’s Theorem. 

It is intuitively obvious that the point with the best chance of being in 
several intervals is a: = 1/2: let us be more formal about this now, using the 
argument from |3]. The probability that a random interval [^, R] does not 
contain x is the probability that both X and Y are less than x, which has 
probability or the (exclusive) possibility that both are greater than x, 
which is (1 — thus the probability that it does contain a: is 1 — — (1 — 

x)^ = 2x — 2x^. This is maximized for x = 1/2, as the derivative of 2x — 2x^ 
is 2 — 4x which is zero exactly when x = 1/2, when the probability that an 
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interval contains a; is 2 ■ 1/2 — 2 ■ (1/2)^ = 1/2. In a different sense, since 
the endpoints of the n random intervals follow the the uniform distribution 
in [0,1], then with probability equal to 1 will be distinct. Moreover, the 
mean number of a variable, which follows the uniform distribution is equal 
to As a result, the expected number of intervals is indeed n/2. 

The number of n random intervals containing 1/2 is binomial; Bin(n, 1/2). 
That is because each random interval in (0,1), either it will contain 1/2 with 
probability p say, or it will not contain it, with probability 1 — p. We have 
n independent Bernoulli trials so the number of intervals containing 1/2 is 
indeed a Binomial random variable. This takes a value very close to n/2, as 
we saw above: in particular, it is n/2 + o(n). This gives a lower bound on 
the clique number. 

We now need to show that it is not more than n/2. Consider intervals 
of the form Jj = [i/v?, {i + 1)/v?]. We need the following technical result 
(which Scheinerman calls the ‘medium deviations lemma’ for binomial ran¬ 
dom variables); 


Lemma 5.2 If X = where Xi = 1 with probability p and is zero 

otherwise, and the Xi are independent, then for 


1 < h < min 


np{l — p) (pn)A3'| 

10 ’ 2 J 


then 


P{\X-np\ >h)< 


ijnp{l-p) 

h 


exp(- 


h^ 


2np{l — p) 
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The above lemma comes from [S]. 


How does this help us? Let us work out the probability that more than 
n/2 + o{n) of the n random intervals intersect Jj. (We are aiming to show 
that this does not happen). The probability that a random interval [W, H] 
does not intersect Jj is the probability that both X and Y are less than i/ri^ 
(which is /n^), or that both are bigger than {i + l)/n^ which has probabil¬ 
ity (1 — Thus the success probability (i.e the probability that it does 

intersect /*) is 


i + 1 




= 1 




1 - ^ 

(1 _ hi+h + (i±h!; 






2{l+l) *2 + 






n’ 


Hence the number X of the n random intervals which intersect is binomial 
with n trials and success probability 

2 (^ + 1 ) ^2 (, + 1)2 

rfi n'^ n'^ 


We need to work out which value of i maximizes this expression. Let us treat 
the more general problem of which (continuous) value of x maximizes 

2{x + 1) a;2 (x + 1)2 

/W =-^- 4 - 4 — • 
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Differentiating, we get 


fix) 


2 2x 2(a; + l) 



so the turning point is at f'{x) = 0, 

nV 2 

“ T w ■ 



— 1 


2 


Since f”{x) < 0, this turning point is a maximum. Note that at it we have 


fix) = 


+ 1 {vf — 1 )^ {in? + 1 )^ 


= 1 + 


1 1 
- + 


4n^ 

1 1 


An'^ 

1 1 




2in? An? 


2n? An^ 


1 1 1 

1 ^ 2n^ -I 
” 2 ^ 2n^ 


The point about this number is that it is very close to 1/2. 

We now (rather arbitrarily) consider 0.6. Note that ® < (np)^/^/2 and 
n?-^ < np{l — p)/10 for large enough values of n. The same holds with 
0.6 replaced by 0.61. (This is checked so that we can apply the medium 
deviations lemma in a minute). We have 


P{X > n/2 + n°-®) 
< P(Bin(n, 1/2 + {2n? — l)/r?) > n/2 + n°'®) 
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since the success probability for the latter binomial is the largest possible 
value of the success probability under the constraints. 

This we now apply the medium deviations result to, taking h = ® and 

t = hj^np{l — p) where p = 1/2 + {2rP — l)/n"^. Note that given e > 0, 
p < 1/2 + e for all large enough n. Thus p{l — p) is > (1/2 + e)(l/2 — e) = 
1/4 — e^, and on the other hand is < 1/4. Thus t is between ^l/A — 
and Thus 

1 

< P(Bin(n, 1/2) > n/2 + n°'®) < - exp(-) 

t 2 

and this is, by the above estimates for t, 

< , ^ 0 as n —)■ cx) 

y'^l/d — 

and this completes the proof. • 

The following simple consequence is also contained in Theorem 4.7 of |9]. 

Corollary 5.3 The chromatic number of a random interval graph satisfies 

p(x(G) = | + o(n)) =1, 

Informally: it is usually approximately n/2. 

Proof. Interval graphs are perfect, and all perfect graphs have uj{G) = 
x(G), as we saw in our essay |2]. The result now follows from the previous 
theorem. • 
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Here is a comparison of the result with what happens for G(n, 2/3), though 
we do not give the proof. It turns out that the clique number of G(n, 2/3) is 
(with probability tending to 1 as n oo) about 21 og 3 / 2 (^), which of course 
is far smaller than for the random interval graphs. Also, in Erdds-Renyi, 
the chromatic number is (again, with probability tending to 1 as u —)■ cx)) 
about ?7,/(2 log 3 (n)), which is of course much larger than the clique number. 
In particular, the Erdds-Renyi graph is very far from being a perfect graph, 
since the chromatic number is (for large n) so much larger than the clique 
number. Note that in both the case of the clique number and the case of 
hte chromatic number, we get noticeably larger answers for random inter¬ 
val graphs than we do for Erdds-Renyi graphs. We refer to [1] for detailed 
statements of these two results. 

Here we present a reformulation from [3] of the result we have just proved 
about the clique number. 


Theorem 5.4 Let the random variable An denotes the size of the largest set 
of pairwise intersecting intervals in a family of n random intervals. There 
exists a function f{n) such that: 


lim 

n^oo 


fin) 


n 


77 / 77 / 

and Jim P(- - f{n) <An<- + fin)) 


0 

1 
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5.2 Independent sets in random interval graphs 

Recall from j2] that an independent set in a random interval graph is the same 
as a chain in the random interval order associated with it. We thus investigate 
the length of the longest chain in the partial order, so as to obtain a result on 
the size of the largest independent set. Let a graph with n vertices where the 
set of edges is equal to the emptyset, then the corresponding random interval 
graph consists of n disjoint intervals. Thus, the size of the maximum chain is 
equal to n. The ordered intervals form a chain, in the sense that they have a 
null intersection. On the other hand, in a complete graph where all the edges 
are existent, the chain is equal to the null set, as all the intervals intersect. 
Again, this material is based on |3] sharpening results in jS]. Recalling from 
[2] a partially ordered set is a non-empty set, which has the mathematical 
property of order. Taking randomly two elements, say x and y, then if we 
have X < y or y < X we define these elements as being comparable forming a 
chain. In other case, we define them as incomparable, forming an antichain. 
In the case of a random interval order, where for two intervals [a, h\ and [c, d], 
we say [a, h\ -< [c, d] if 6 < c. Thus an independent set in the random interval 
graph corresponds to a set of non-intersecting intervals. 

Theorem 5.5 Let denote the maximum number of pairwise disjoint in¬ 
tervals in a family of n random intervals. Then, 



in probability. 
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The proof of this Theorem comes from |3]. 


Proof. We generate a Poisson process with intensity 1 in the upper right 
quadrant. Thus, since the probability function for a discrete Random Vari¬ 
able X, which follows the Poisson distribution is where A is the ex- 

pected value of X, the probability that, for any positive number s, the region 
{{x,y) : 0 < x,y < s} does not contain any point of the process is equal to 
e . 

We are going to choose an infinite chain of points of the process. Let C = 
(£ 2 ,^ 2 ),...}, the points being chosen as follows. Let be the 

point that minimizes max{£i,Mi} and thereafter {ik,Uk) is the point above 
{ik-i,Uk-i) that minimizes ma.x{ik,Uk}- Thus the points of the chain are 
chosen subject to the restrictions that they are monotonically increasing and 
the difference between every two ‘neighboring ’points of the chain minimum. 
Thinking of each point in the chain as defining an interval, it is easy to see 
that thus this chain is built up from the bottom by always choosing the next 
interval to be the one with least possible upper endpoint. It is not hard 
to check by induction the intuitively reasonable claim that, in any finite 
collection of intervals, this chain will have the maximum possible length. 

Then, if S' is a variable, whose value is ma.x{xi,yi) the mass density 
function of S is given by: 

/(s) = L(1 _ = 2se-^ 
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This is because we have 


P(max(a;i,yi) < s) = 1 — P{xi > s and yi > s) 

= 1 — P(the square (0, 0), (0, s), (s, 0) and (s, s) contains no point of the process) 

2 

= 1 — e~^ as observed above, using that the intensity is 1 

Thus F{s) = P{S < s) = 1 — . Thus its density /(s) is the derivative of 

this with respect to s, which is indeed as stated. 

Therefore, we have 


r°° 2 

EiS) = / 2se ^ s.ds 
Jo 


t^e dt 


= r(3/2)= 


TT 


We now claim the differences 

Xi = max(£i, til)— 0 , X 2 = max(f' 2 , n 2 )—max(f'i, ui), X 3 = max(f' 3 , M 3 )—max(f' 2 , ^ 2 ), • • • 

are independent and identically distributed with mean To see this, we 
have just proved this for the first difference. Now we, so to speak, move 
the origin to and use the homogeneity of the Poisson process to get 

that the variables are identically distributed. Independence follows from the 
independence properties of the Poisson process. 
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Therefore, by the Law of Large Numbers, m for any e > 0 sufficiently 
small. 



just by simplifying the telescoping sum in the dehnition of the Xj. 

Let r(n) denotes the minimum r, such the area [0,r]^ contains exactly 
n points of the Poisson Process. Then these points determine n random 
intervals. (Conditional on the number of points of a Poisson process in a 
certain area being given, the points themselves are uniformly distributed 
over that area). 

Recall that we are studying the size of the largest independent set in a 
random interval graph with n intervals, that is the longest chain of intervals 
in the corresponding interval order. Thus the above remarks show that we 
can identify with the largest m such lies in the area [0,r(n)]^. 

Now, because the Poisson process has density 1, we have 


lini P (^(1 — e)^/n < i{n) < (1 + e)\/^) = 1 


using the Law of Large Numbers m- Thus, if now we let 


nil = [(1 — ^){^=)\/n\ and 
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we see by the previous results that, for n sufficiently large, the point Umi) 


will he inside the square [0,r(?7,)]^ and the point {im 2 ,Um 2 ) will lie outside of 
this area, with probability tending to 1. Thus we indeed get 



which completes the proof. • 

We again compare this with the result for G{n, 2/3). Here it turns out that 
the independence number is, with probability tending to 1 as n ^ cxo, about 
21 og3(n). Again we refer to [1] for a proof of this fact. Note again that this 
number is much smaller in the G{n, 2/3) than in the random interval graph. 

5.3 Comparison of different models 

Here we discuss the differences, which arise in two different models of ran¬ 
dom graphs, namely the Erdos-Renyi model and the random interval graphs. 
In the hrst model, the possibility of an edge arising is a constant equal to 
2/3, independent from the number of edges. Moreover from j9j we present 
other features of random interval graphs, as the value of minimum, maxi¬ 
mum degree for the two models, chromatic and independence numbers. As 
we previously saw, for the Erdds-Renyi model the minimum and maximum 
degrees are |n — o(n) and |n -|- o(n) respectively and all the degrees are 
roughly close to 2n/3. On the other hand, from [9] we have that the min¬ 
imum degree is equal to 0{y/n) and the maximum degree is about n — 1 
for the ordinary model. Moreover chromatic and independence numbers are 
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^(logn) logarithmic, O(logn) for Erdos-Renyi model, see j^j. Also, the 
clique number is equal to independence number. Furthermore, in this model 
random graphs are not perfect, as the chromatic number is not equal with the 
clique number. In random interval graphs chromatic number and clique num¬ 
ber is about to |, as we saw in above sections. Finally, from j9] independence 
number is equal to 0{y/n). This comparison between these models clearly 
shows the differences arising in their characteristic values, which determine 
their properties. 
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6 Variants 


6.1 Introduction 

In this rather miscellaneous section, we discuss an alternative way to de- 
hne random interval graphs, a recent generalization by Scheinerman of these 
graphs and some applications of them. 

6.2 Scheinerman’s generalization 

Scheinerman has recently introduced a common generalization of both ran¬ 
dom interval graphs and the Erdos-Renyi graphs, namely random dot product 
graphs. Also from dni he gives various dehnitions of interval graphs. 

To understand Scheinerman’s idea, we hrst introduce Intersection Graphs. 
Suppose we have a hnite set of n vertices, Vn- At each vertex v G I4i, we 
have a subset St, C R (here, as usual, R is the set of the real numbers). We 
now say that two vertices are adjacent if and only if the corresponding sets 
have a non-null intersection. In mathematical notation 

V ^ w n 7 ^ 0. 

So an interval graph is a special kind of intersection graph, with the set Sy 
for each vertex v being an interval of the real line. 

Moreover, in dni random intersection graphs are introduced, by assigning 
randomly sets Sy to the vertices, and then we again say that two vertices 
are adjacent if their corresponding sets intersect. The usual way to assign 
these sets is to say that each Sy is a subset of {1, 2 ,... fc} with, for each v, 
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P{i G S^) = p say, independently for 1 < i < k and each vertex choosing its 
subset independently. However there are other possibilities. 

A further model studied in are Threshold Graphs. Here, For every 

vertex v , we assign a number Then, two vertices intersect if and only 

if the sum of the corresponding numbers is > 1. Again, in mathematical 
notation, + > 1. Again we can have random threshold 

graphs by generating the in some random way. 

The main business of is to give a new dehnition of a model of random 
graphs which combines all these dehnitions, by using dot products. Here, 
vertex v is assigned a d-dimensional vector of real numbers X^. Then, two 
vertices are adjacent, if the corresponding inner product of the vectors is > 1. 
Mathematically, 

V ^ w • Xu, > 1 

where • denotes inner (dot) product: 

d 

{xi,X2, ...Xd)» ( 2 / 1 , 2 / 2 , ■■■yd) = ^XiPi. 

i=l 

The idea behind the dehnition of random dot products is that various 
ways of dehning random interval graphs can be replaced by the random 
dot product. Indeed, we have that each vertex i is randomly assigned a 
d-dimensional vector Xj. Here d G N is hxed. The vectors themselves 
can be generated from some d-dimensional distribution; this could be each 
component chosen independently, but there are other possibilities as well. We 
now say that i ~ j with probability /(Xj • Xj) for some hxed, and carefully 
chosen, function /. 
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This is a general definition, which generalizes several of the definitions 
above: 

Erdds-Renyi graphs: generalized because if we take, for every vertex n, 
Xy = X = (x, X,... x) where x • x = p, and f(x) = x, a moment’s thought 
will show that we recover the G{n,p) model. 

Random intersection graphs with each Sy C {1,2,... k}: because, if we 
take Xi to be the vector whose jth component is 1 if j G S'^,, and whose 
jth component is 0 otherwise, then the property that two vertices v and w 
are adjacent if and only if n 7^ 0 can be written as the property that 
V w with probability f{Xy*Xyo), where /(f) is 0 if f = 0 and is 1 otherwise. 

(Note: Observant readers will have observed that this is only a general¬ 
ization, in the strict sense, of the random intersection graphs in the case 
when each Sy C {1,2.. .k], whereas of course to get intersection graphs to 
generalize interval graphs we have to have the Sy being infinite sets, namely 
certain intervals of the real line. However it is certainly a generalization in 
spirit of the idea). 

An attractive feature of this very general definition is that we can com¬ 
bine random and non-random ideas in giving the definitions of the vectors, 
according to the situation we are working in. 

Scheinerman dj] starts by giving some results for the case when d = 1 
and the ‘vectors’ (really, in this case, scalars, so we will denote them by the 
lower case letter) Xi are uniformly distributed on [0,1]. He takes /(f) = G 
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for some fixed r; these assumptions will remain in force throughout this 
paragraph. Now we have 


P[i ~ j] 


/( 


r-l /•! 


Xj • Xi) = 


10 Jo 


{XiXj 


''dxidxj 


1 

(1 + r )2 


since this is the average, over all possible values of Xi and Xj, of f(xiXj). This 
is, just by simple integrations. 


nl nl 

x\dxi / 


} Xi={) 


x^jdxj 


X 


r+l 


V + 1 


1 

Xi=0 


. X, 


r+l 




r + l 


r + l 


(1 + r)2 

Thus, the expected number of edges is (1 + r)“^, since it is the ex¬ 

pectation of a sum of n{n — l)/2 indicator variables of whether each edge 
is present, each indicator having expectation 1/(1 + r)^. He also presents a 
short calculation, the details of which we omit, showing that if a ~ 6 and 
& ~ c, then conditional on this information P{a ~ c) is larger than it would 
be unconditionally: that is, there is a ‘clustering’effect. He believes, but 
cannot at present prove, that the degrees in the graph follow a ‘power law’; 
that is, letting N{d) denote the number of vertices of degree d, a plot of 
log(iVrf) against log((i) should be a roughly straight line with negative gra¬ 
dient. Scheinerman observes that various large networks arising in real life 
have been observed to have this property (at least roughly). He further ob- 
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tains that the expected number of isolated vertices is for a suitable 

constant > 0. In particular, these graphs are not connected: however, 
they do have a very large component and a few isolated vertices. Further 
they have diameter at most 6 . (The diameter of a graph is the worst case of 
the distance between two points in it). 

Moreover, Scheinerman m introduces the ‘inverse problem’. Given a 
graph on a specihc set of vertices, which vectors are more suitable to model 
his graph? An obvious approach is to say that the best choice of Xs are 
those which maximize the likelihood function. This is doable in dimension 
1 , though in higher dimensions it becomes very unpleasant fairly rapidly. 
He thus suggests an alternative approach based on matrix theory, the Gram 
Matrix Approach. In detail: given Gi, G 2 ■ ■ ■ Gm ^ = m Si=i 

^ P[i^ j]= XiXj{i 7 ^ j) 

X = [Xi,X2 ...,Xn] 

A = A*A 


6.3 Prisner’s definition 

In [7], E. Prisner proposes the following question: 

‘What other reasonable models, apart from Scheinerman’s, are there for ran¬ 
dom interval graphs? For example, suppose we choose n unit intervals (i.e. 
intervals of length 1 ) which are chosen from the interval [ 0 , m] and are chosen 
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uniformly and at random? 


We are not aware of any substantial work on this model. 

6.4 Applications of Random Interval Graphs 

The obvious applicability of random interval graphs is to scheduling and 
assignment problems. For example, suppose each of n people in an office has 
an interval each day when he is free for a meeting. The exact size of this 
interval, and its position in the period of the working day (late, or early, or 
whatever) will vary from day to day, so can be modeled as random. Then, 
if we have a random interval graph whose vertices are the n individuals and 
where two vertices (individuals) are adjacent if and only if their random 
intervals intersect, then we are saying that these two individuals will have 
a chance to meet on that day: and, for example, the largest clique in the 
random interval graph will be the largest number of people who can all 
meet. Similarly, the largest independent set will be the largest set of people, 
no two of whom can meet. Of course, the assumption that the intervals are 
uniformly distributed over the working day is probably not very realistic: 
for example, most people will be unavailable for some time over lunch, and 
in practice they will have several time intervals at which they are available, 
rather than just one. (T am available between 9.00 and 10.00, and between 
2.00 and 3.00’). However it is a reasonable hrst model. 

Similarly, if we have a series of jobs to carry out in a factory. Suppose for 
example we are making a car or similar. Various tasks - say, n of them - have 
to be carried out during the production process (for example: painting the 
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outside, installing the radio, checking the braking system, etc.), and usually 
we cannot be doing more than one of these things at once. There will be 
time intervals during a working day in which the people qualified to carry 
out the various tasks (brake testing, painting etc.) are available: again, these 
will be hard to predict in advance, so can be modeled as random. Again, we 
will then want to have a large independent set in the graph, as that means 
we have the corresponding time intervals are disjoint, so there are no clashes. 
That is, by doing one of the jobs in its time interval, we are not reducing our 
chance of getting one of the other jobs done that day. (If the painter and the 
radio installer are available in disjoint time intervals, then we know that we 
can just get on with doing the painting and this will not reduce our chances 
of getting the radio installed that day as well). 

Moreover, from my essay, [2] interval graphs are widely used in resource 
allocation problems. That is, we want to allocate a fixed amount of assets 
in production activity, in order to maximize profit. This is a problem in the 
field of combinatorial optimization. Another application of interval graphs is 
their usefulness in many problems of this field of discrete Mathematics. An 
example is the traveling salesman’s problem. A salesman leaves his home 
and he is willing to visit n towns. He then has to consider n\ alternative 
feasible tours. We want to find the optimal tour, so as to visit each town 
only once and to minimize the relevant cost of traveling. Obviously, this is a 
challenging problem, as the set of feasible tours is too vast. (For example, if 
he has to visit 4 towns, the number of feasible tours is 24. If he has to visit 
say 6 towns, then we have to consider 6!=720 different tours!). Consider, to 
each town we assign a vertex. Then, two vertices are adjacent if the salesman 
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leaves the town and goes to the other. Then, to each vertex let’s assign an 
interval. Clearly, if two intervals intersect, there is an edge arising. But, it 
is a cost associated to the salesman tour. So, we aim to find the tour having 
minimum cost, so all the n towns will be visited. Also interval graphs have 
many other applications, as we shall see. 

The textbook by McKee and McMorris [S] in Chapter 3 contains refer¬ 
ences to various applications of interval graphs in Biology, Psychology and 
Computing. In Biology, for example, from |3] a prominent application of In¬ 
terval Graphs is the physical mapping of DNA. From a DNA sequence, some 
fragments, which are called clones are obtained and the goal is to reconstruct 
the placement of the clones: that is, where they are on the DNA string. Thus 
a clone is an interval of a line of DNA. 

To turn this into a problem about random interval graphs, we assign to 
each clone a vertex. Two different clones are adjacent if and only if their 
corresponding intervals intersect. This clearly gives an interval graph. 

Also, interval graphs are used in social sciences. For example, in Psychol¬ 
ogy, m they are widely used as tools, measuring notions, which determine 
different psychological theories. (Most theory of measurement is based on 
physical science: however, in the social sciences, different theories may be 
more appropriate). 

An example from jS] is that a person has a set A of alternatives solutions 
to choose. For simplicity, suppose that elements of A are different makes 
of cars. Our person has preferences among the the different makes of cars: 
for example, he might prefer stylish cars or cars having low cost of service, 
etc). Then, a real-valued function / on the set A, such that for a,b E A, 
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he prefers alternative a than b when /(a) > f{b) + 6, where 5 is a positive 
constant representing a threshold - a ‘just noticeable’ difference between the 
two kinds of cars. Then, we dehne a binary relation i? on A to be an interval 
order on the set A of alternatives, if it satishes the two following axioms. 
(We are thinking here of aRb as meaning that a is preferable to b). 

Axiom 1; For all a G v4, not aRa 

Axiom 2; For all a, b,c,d ^ A, if aRb and cRd, then either aRd or cRb 

The motivation is that a car is not preferable to itself (clearly), which gives 
Axiom 1. Similarly, if a is preferable to b and c preferable to d, it seems 
reasonable that at least one of a is preferable to d and c is preferable to c 
should hold. 

Furthermore, interval graphs are used in Computing. They are used in 
scheduling problems. From [5] we have an interesting application of this 
class of problems. Suppose that we want to hnd an arrangement, in order to 
construct a timetable for different courses in a University. We have a fixed 
number of rooms available for teaching purposes and we know the number 
of teachers. We aim to construct an efficient timetable, so to be no over¬ 
lap between teaching hours for every lesson. We assign various courses to 
vertices. Then, two vertices are adjacent, when the corresponding intervals 
intersect. When this is the case, we have two different courses at the same 
hour. Thus, we want to hnd the minimum number of rooms needed, in or¬ 
der all the courses to be taught. In "graph language" we want to hnd the 
chromatic number. That is the minimum number of colors needed, so two 
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connected vertices have different colors, [Hj. Moreover, in problems related 
to information retrieval, we use interval graphs. Suppose that $ denotes set 
of files, which contain information and Q is the set of queries for retrieving 
information. Then, $ and Q satisfy the consecutive retrieval property if the 
files relevant to each query can be stored consecutively in a linear form, so not 
to be overlap, |1]. We easily deduce from the above the extensive use of inter¬ 
val graphs in various, different sciences, from areas of applied Mathematics 
to social sciences, as Psychology. 
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7 Conclusions: areas for further work 

In this project, we examined Random Interval Graphs, emphasizing Schein- 
erman’s dehnition. We presented some resnlts on the nnmber of edges and 
then considered the degrees of vertices in the graphs, observing that these 
are much more spread out that in the alternative Erdds-Renyi model of ran¬ 
dom graphs: for example, the maximum degree of a random interval graph 
is very likely to be close to n — 1, and indeed is equal to n — 1 with prob¬ 
ability 2/3. We then considered cliques, independent sets and chromatic 
numbers of random interval graphs, obtaining asymptotic estimates of each 
of the quantities involved and comparing their values with the values in the 
Erdds-Renyi model. Finally, we wrote about other ways of defining Random 
Interval Graphs, such as Scheinerman’s random dot product graphs, empha¬ 
sizing the 1-dimensional case of Scheinerman’s theory. Finally we outlined 
some areas of application. 

There are some questions left unanswered by our work. For example, it 
would be desirable to investigate measures of connectivity (such as vertex- 
connectivity or edge-connectivity) in random interval graphs. Also; what is 
the diameter of a random interval graph? The obvious guess would be that 
it is 2, since if we have two vertices v and w, one would hope that one of 
the many vertices of high degree (close to n — 1) will be adjacent to both of 
them. Gertainly the probability that the diameter is 2 is at least 2/3, since if 
there is a vertex of degree n — 1 is till be adjacent to both v and w (if either 
n or tc is a vertex of degree n — 1, then it is clearly adjacent to the other). 
The result that the probability is at least 2/3 is now just a consequence of 
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the fact that with probability 2/3 there is a vertex of degree n — 1. However 
the guess that the diameter of a random interval graph is 2 with probability 
tending to 1 does not seem to follow immediately from what we have proven, 
because it is not quite clear that we can avoid the situation where there are 
two vertices v and w of low degree and all the vertices of high degree fail to 
be adjacent to at least one of v and w. 

Another topic is the existence of Hamilton cycles in a random interval 
graph. Scheinerman ([S]) shows, by a rather long and difficult argument, 
that with probability tending to 1, a random interval graph is Hamiltonian; 
that is, it has a cycle which passes through every vertex of the graph. A 
G{n, 2/3) is also Hamiltonian: indeed G{n, 2/3) has the stronger property 
that it has (with probability tending to 1) \_5{G)/2\ edge-disjoint Hamilton 
cycles: we refer to [1] for a proof of this fact. (Two cycles are edge-disjoint if 
and only if there is no edge which is in both cycles). Note that \6{G)/2\ is 
the largest number of edge disjoint Hamilton cycles we could have in a graph 
G, because each Hamilton cycle will use up two edges in passing through a 
vertex v of degree 5{G). Is it true that a random interval graph has [5{G)/2\ 
edge-disjoint Hamilton cycles? Again this does not seem to be obvious. 

We hope that we have given a reasonable material of random interval 
graphs, some interesting results on them, and some ideas of how they might 
be useful. 
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